PUTTING LANGUAGE INTO LANGUAGE MODELINGy

نویسندگان

  • Frederick Jelinek
  • Ciprian Chelba
چکیده

In this paper we describe the statistical Structured Language Model (SLM) that uses grammatical analysis of the hypothesized sentence segment (prefix) to predict the next word. We first describe the operation of a basic, completely lexicalized SLM that builds up partial parses as it proceeds left to right. We then develop a chart parsing algorithm and with its help a method to compute the prediction probabilities P (wi+1jWi): We suggest useful computational shortcuts followed by a method of training SLM parameters from text data. Finally, we introduce more detailed parametrization that involves non-terminal labeling and considerably improves smoothing of SLM statistical parameters. We conclude by presenting certain recognition and perplexity results achieved on standard corpora.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thematic and Structural Analysis of Sepideh Kashani’s Poetry

This article discusses the structural and thematic aspects of poetry by Sorour Azam Bakouji, known as Sepideh Kashani. Her poetry contains not only revolutionary poems before the Islamic Revolution in Iran but also those about imposed war on Iran by Iraq. Regarding this fact, the writer of the current study tries to discuss which types of genres, figures, images, as well as language have been u...

متن کامل

The Effect of Social Network Use on EFL Learners’ Second Language Achievement: An Investigation into their Attitudes

The efforts were made in the present study to seek two objectives: determining the effect of Telegram as a social network on second language achievement of Iranian foreign language (EFL) learners, and exploring the EFL learner’ attitude toward using Telegram for language learning purposes. To this end, 40 EFL learners were randomly selected and then divided into two groups of experimental and c...

متن کامل

Iranians’ Belief about Language Learning: The Role of Sex and Language Proficiency

The purpose of this study was to see the role of English language proficiency level and sex on Iranian students’ beliefs about language learning. This study also investigated the usefulness of the BALLI questionnaire (Horwits, 1988), which checks learners’ beliefs, for the context of Iranian English language learners through conducting an interview. A total of 171 Iranian learners from safir in...

متن کامل

Bound Morpheme Frequencies in the Performance of Iranian English Language Undergraduates and English Language Materials Developers in Written Descriptive Tasks

This mini-corpus, cross-linguistic, comparative, and norm-referenced study intends to render the most frequently and oft-used affixes in the written descriptive tasks in the performance of English language materials developers (ELMDs) and Iranian English language undergraduates (IELUs). Samples of writings of both groups were studied and analyzed through affixation principles. The frequency of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999